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Abstract 

This article deals with optimizing problems classified by the kinds 
of restrictions as required in differential geometry and in mechan¬ 
ics: holonomic and nonholonomic. The central issue relates to dual 
nonholonomic programs (what they mean and how they are solved?) 
when the nonholonomic constraints are given by Pfaff equations. The 
original results are surprising and include aspects derived from the 
Vranceanu theory of nonholonomic manifolds, from the geometric dis¬ 
tributions theory and from Darboux’s theorem on canonical coordi¬ 
nates in which we can express a Pfaff form. On these ways we get 
also an original Riemannian geometry attached to a given constrained 
optimization problem. 
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1 Classical Lagrange and Wolfe 
dual programs 

In this Section, we re-discuss the well-known programs with holonomic con¬ 
straints insisting on the following issues n, a, i: (i) the Lagrange-dual 
problem with weak respectively strong duality; (ii) the Wolfe-dual problem; 
(hi) pertinent examples. 
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1.1 The Lagrange dual problem 

Let D be a domain in i?", let x = be a point in D and f : D R 

and Qa ■ D R he convex functions. Denote g = (go) and we introduce the 
set 

= {x G D \ ga{x) < 0, a = I, = {x G D \ g{x) ^ 0}. 

A complete notation for this set is {VL,g,^), but for short the sign ^ or 
the pair {g, are suppressed in the notation. Let us consider the convex 
program 

(P) min{/(a;) | x G D}. 

X 

The Lagrange function (or Lagrangian) of (P) 


m 

L{x, A) = f{x) + ^ Xa9a{x) = f{x)+ < X, g >, X E D, X ^ 0 

a=l 


is convex in x and linear in A. 

Remark We can create an original Riemannian geometry on the set of 
critical points, using similar ideas we shall develop in a further 3.2.1. 

For all A P 0, the inequality 

m 

y^^Xaga{x) <0, Vx e D 

0=1 


holds. Consequently 

(1) L{x, A) < f{x), 'ix e D, VA P 0. 

The equality holds iff (complementarity conditions) 

Xaga{x) = 0 (for each a = 1,..., m). 

Let us introduce the Lagrange dual function 

m 

fj^X) = inf {f{x) + ^ Xaga{x), X e D, A p 0}. 

x£D 

CL=\ 
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This function iplX) is concave, because it is a point-wise infimum of affine 
functions. Indeed, using the linearity of L{x, A) with respect to A, and intro¬ 
ducing A^ ^ 0, A^ ^ 0, and 0 < t < 1, we have 

ip{tX^ + (1 — t)X'^) = inf L{x, tX^ + (1 — ^)A^) 

xgD 

= inf {tL{x, A^) + (1 — t)L{x, A^)) > inf {tL{x, A^)) -|- inf ((1 — t)L{x, A^)) 

xGD xGD x£D 

= t inf L{x, A^) -|- (1 — t) inf L{x, A^) = tilj{X^) -l- (1 — t)ijj{X^). 

x&D x£D 

Definition 1. The problem 


sup {t/’(A) I A ^ 0} 

A 

is the so-called Lagrange dual problem of (P). 

The Lagrange dual problem can be called convex because it is equivalent 
to the convex problem 

inf {—'^(A) I A ^ 0}. 

A 

The Lagrange-dual problem is also dehned in this way if (P) is not convex. 
The following theorem holds also in that case. 

Theorem 1. (Aveak dnalityj The dual function yields lower bounds of the 
initial optimal value /*, i.e., for any X, we have t/’(A) < /*. In other words, 


sup {t/’(A) I A ^ 0} < inf {f{x)}. 




Proof In the foregoing statements, we have the relation (1). Since Q G D, 
for each A ^ 0, we hnd 


■^(A) = inf L{x,X) < inf L{x,X) < inf f{x). 

xGD xGQ 

Thus the statement in the theorem is trne. 

The problem of hnding the best lower bonnd on /* obtained from the 
Lagrange dual function is called the Lagrange dual problem for the original 
or primal problem. 

The optimal values may be different. However, they are equal if (P) 
satishes the Slater condition and has hnite optimal value. This is the next 
resnlt. 
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Theorem 2. (strong duality) If the program (P) satisfies the Slater con¬ 
dition and has finite optimal value, then 

sup {'^(A) I A ^ 0} = inf {f{x) \ g{x) ^ 0}. 

A 

Moreover, then the dual optimal value is attained. 

Proof Denote by /* the optimal value of (P). Taking a = /* in the Convex 
Farkas Lemma, it follows that there exists a vector A* = (Ai*,A^*) > 0 
such that 

m 

L{x, K) = f{x) + ^ Aq* ga{x) > /*, Wx e D. 

a=l 

Using the dehnition of this implies '^(A*) > /*. By the weak duality 

theorem, it follows that '^(A*) = /*. This not only proves that the optimal 
values are equal, but also that A* is an optimal solution of the dual problem. 

Remark Unlike in Linear Programming theory, the strong duality theo¬ 
rem cannot always be established for general optimization problems. 

1.2 Topology of Lagrange multipliers 

Let / ; D C i?” —)■ i? and g : D C ^ BP, p < n, oi class with 
rank .Jg = p m. D. Let L{x,\) = f {x) + \ ■ g (x ), with A G R^. We recall 
that 

R (x, A) = V/ (x) + A ■ Vg (x) = 0 

is the equation of critical points with respect to x of the Lagrange function. 

Let A = {x I 3A cu (x, A) = 0} and B = (A | 3a; cu if (x, A) = 0} . 
Introduce h : A ^ B such that H {x, h {x)) = 0. The function h is well 
dehned since the equation H [x, A) = 0 is linear in A (system with unique 
solution). Hence, for any \ E B, the set h~^ (A) is non-void, and it consists of 
all critical points corresponding to A (set in which the nondegenerate critical 
points are isolated). 

Proposition 1. Let Xq E B such that there exists Xq E h~^ (Aq) with the 
property that xq is nondegenerate, i.e., the Hessian df f {xq) + Aq ■ d'^g {xq) is 
nondegenerate. Then h admits a differentiable section saq : I\q -E A. 

Proof. Since ^ (a;o, Aq) = d'^f (a;o) 3- Aq ■ d'^g (a;o) is non-degenerate, by hy¬ 
pothesis, there exists a neighborhood ^Ao of Aq and a differentiable function 
sao : ho a such that H {sxo (A), A) = 0, VA G ho and saq (Aq) = Aq. 
Moreover, the function saq is unique, with these properties. □ 
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For any A G -B, let Sx be the set of all sections of h defined in a neigh¬ 
borhood of A, set which is eventually void. 

Remark 1. (i) If h~^ (A) contains at least one nondgenerate critical point, 
then S\ is non-void. If h~^ (A) does not contain degenerate critical points, 
then the sets h~^ (A) and Sx have the same cardinal and are discrete sets, 
(a) The set C = {A G R I Ba 7 ^ 0} is open. 

In the following, we suppose that the set Sx is hnite, for any A G R. 
We can dehne f* : B ^ R hj f* (A) = max^gs^ / (s (A)) , if Ra 7^ 0 and 
r (A) = -oo, if Ra = 0. 

Proposition 2. (i) For any X ^ B, we have 


f* (A) < sup,,g;,-i(A)/(a;). 

(a) If h~^ (A) does not contain degenerate critical points, then 

f* (A) = sup,,g;,-i(;,) / (x) = niax,,g,,-i(A) / (x). 

Proof, (i) Let sq G Ra cu / (sq (A)) = max^gs^ / (s (A)) = /* (A). Since 
So (A) G h~^ (A), it follows that f* (A) < sup 3 ,g;j-i(-;^) / {x). 

(2) By hypothesis, the sets Sx §i h~^ (A) have the same cardinal, hence 
h~^ (A) is hnite. Let yo G h~^ (A) with / (|/o) = maXj,g^-i(A) / {x). Since 
{yo, A) is a nondegenerate critical point, there exists si G Ra with si (A) = yo- 
Then, it follows that 

max / (s (A)) > / (yo) = max (A) / (x). 

seSx x&h-^ 


□ 

Proposition 3. Let Aq G R such that h~^ (Aq) does not contain degenerate 
critical points. Suppose, also, that f\h-^{Xo) injective. Then there exists 
So G Rao, So : Jo -t a such that f* (A) = / (sq (A)), for any A G Iq. 

Proof. Let So G Raq, Sq : Iq ^ A such that f*{Xo) = /(’So(Ao))- Then 
f* (Ao) = / (so (Ao)) > f{s (Ao)), Vs G Raq, s : R A. Since / is contin¬ 
uous and the set Raq is hnite, it follows that we can restrict the neighbor¬ 
hood Jo such that / (sq (A)) > /(s(A)), VA G Jo, Vs G Raq, i-e., /* (A) = 
/(so(A)),VAgRao. □ 
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1.3 The meaning of Lagrange multiplier 

In our mostly geometrical discussion, A is just an artificial variable that lets 
us compare the directions of the gradients without worrying about their mag¬ 
nitudes. To express mathematically the meaning of the multiplier, write the 
constraint in the form g{x) = c for some constant c. This is mathematically 
equivalent to our usual g{x) = 0, but allows us to easily describe a whole 
family of constraints. For any given value of c, we can use Lagrange multipli¬ 
ers to hud the optimal value of f{x) and the point where it occurs. Call that 
optimal value /*, occurring at coordinates Xq and with Lagrange multiplier 
Aq. The answers we get will all depend on what value we used for c in the 
constraint, so we can think of these as functions of c : /*(c), a;o(c), Ao(c) . Of 
course, f{x) only depends on c because the optimal coordinates xq depend 
on c: we could write it as /*(c). 

To hud how the optimal value changes when we change the constraint, 
just take the derivative 


df* _ ^ ^ - V/ — 
dc dxl dc * dc 

Use the equation of critical points to substitute V/* = —Aq Vg'o and obtain 

df* . ^ dxo dgo 

— — —Ao Vgo ■ — "-^0 -y- ■ 

dc dc dc 

But the constraint function g^ = g{xQ{c)) is always equal to c, so dg^/dc = 1. 
Thus, df^/dc = —Aq. That is, the Lagrange multiplier is the rate of change 
of the optimal value with respect to changes in the constraint. 

Of course, /* depends on c through of A, and then We can 

dehne c(A) by Cauchy problem 


dc d\ dc 


(EC) 


dc Id/* 


Then another Lagrange dual function may be 

(LDF) if{X) = Mxo{X)) + Xc{X) . 

Proposition If optimum points are critical points, both Lagrange dual 
functions give the same solution. Hence strong duality holds. 
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Proof Indeed, using (EC) we have 




and (/^'(A) = 0 implies c(A) = 0, that is for Aq. 

Often the Lagrange multiplier have an interpretation as some quantity of 
interest: 

(i) A is the rate of change of the quantity being optimized as a function 
of the constraint variable since = A; 

(ii) by the envelope theorem the optimal value of a Lagrange multiplier 
has an interpretation as the marginal effect of the corresponding constraint 
constant upon the optimal attainable value of the original objective function: 
if we denote values at the optimum with an asterisk, then it can be shown 


that 


|/. = |/Wc)) = A.. 


For details regarding classical theory of programs see n. 0. a. 

If we have more constraints ga{x) = Ca,a = 1,...,m, then the Lagrange 
function is L{x, A) = f{x) + Aa(fi'a(ax) — Ca) and the system of critical 

points is 


dx"^ 


a=l 


dga 

dx^ 


= 0 . 


Because the optimal coordinates Xq and the optimal value /* depend on 
vector c, taking the derivatives we have 


r\ p p 0-7 ^ 0 ITL 

dCa OXq dCa ^ ^ dXQ dCa ^ ^ dCa ^ ^ 


Then we can dehne c(A) by the partial differential system, with initial con¬ 
dition, written in matrix language as 


(EC) 


■ dci 
d\i 

dci ■ 
dXm 


'df^ 

df* ' 

dCm 

dCm 


dXi 

Qa 

>- 

1_ 

dXi 

dXm J 





c(Ao) = 0. 


Then another Lagrange dual function may be 


{LDF) (^(A) = /*(xo(A)) + A„ c„(A). 

a=l 
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Deriving the /3-th equation with respect to and the a-th equation with 
respect to A/? , in the previous system, we obtain the complete integrability 
dc dcR 

“ — ^ (symmetric Jacobian matrix); consequently c(A) is 


conditions 


d\p d\c 


the gradient of a scalar function, namely the Lagrange dual function 

Moreover, the previous square matrix being a symmetrical one, we can 
write the equation {EC) as 


dci 

1- 

O 

dX^ ■ 

■■ dX^ 

dci 

*9 Cm 

. (9A™ 

(9A™ . 


' < 9 /, df^ ' 
dy d\^ 


Consequently, in a regular case, we have the following situation: Solv¬ 
ing a constrained optimum problem we obtain the optimal value as /* = 
/(ci,..., Cm)- For the dual problem we use a /* = /(A^,..., A™'). If the cor¬ 
respondence between (ci, ...,Cm) and (A^,..., A™') is like a change of variables 
there hold the relations: 


grad, /* = -A ; c'(A) A = -gradx f, , c'(A) e L{R^, B!^). 


1.4 Examples and counter-examples 

(1) Let us consider the functions f{x, y) = and g{x, y) = x"^ -\-y'^ — 2x 

and the problem 

min f{x,y) constrained by g{x,y) = c, c> —1. 

The Lagrange function of this problem is 

L{x, y, X) = x"^ + y'^ + X{x‘^ + y‘^ — 2x — c). 

The critical points of the partial function {x, y) —)■ L{x, y, A) are the solutions 
of the system 


IdL 
2 dx 


X + Xx — X 


0 

^ 2 dy 


y + Xy = 0. 


It follows X = y = 0 and hence /* = 
restriction, in critical points, we have the relation 

2 

- 2 - 


A 


A 


A + 1 


On the other hand, by 


A + 1 


= c. 




Figure 1: Geometry of Lagrange duality 


It follows 

df* _ df^ 1 
dc dX ^ 

d\ 

and finally, we obtain the geometrical interpretation ^ = —A. 
The dual function is 

V’(A) = L(x(A), ?/(A), A) = - Ac. 


The value ij{X) is a minimum for A > —1 and a maximum for A < —1, in 
the initial problem. The condition of extremum (critical point), 'ip'{X) = 0, is 
equivalent to (c + 1)(A + 1)^ = 1 and the dual problem has the same solution 
as the primal one. 

On the other hand the equation {EC) for this problem is 


dc 


1 d / A y 

A^ VA + iy ’ 


c(Ao) = 0 , where Aq + 1 = ± . :. 

VC + 1 


We hnd c(A) 


1 

(A + l? 


— 1 — c and the dual Lagrange function 





as the above one. 
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(2) A problem with two constraints Solve the following constrained 
optimum problem: 

f{x, y, z) = xyz = extremum 

constrained by 

gi{x,y,z) = x + y - a = 0, g 2 {x, y, z) = xz + yz - b = 0 . 

The Lagrange function is 


L{x, y, z, A, y) = xyz + \{x + y — a) + y{xz + yz — b) 
and the feasible solution of the problem is, only, 

a b \ b a 

x = y=^ = , z = - = - , X = , y = , 

2 a /i 4 4 

/* = — = 4A/r . 

The Lagrange dual function is y) = —4A/i — aX — by . 

The partial differential system which dehnes ci(A,/r) and C 2 {X, y) be¬ 
comes, in this case, 

(EC) 


[X y] 


■ dci 

dci 


dy 

dC2 

dC2 

. 

dy 


= -[Ay AX ], Cl 


b a 
’4’~4 


= C2 


■-,-“ 1 = 0 . 


Oci dc2 

Taking into account that , we obtain two quasilinear PDEs 

oy oX 

dci dci _ dc2 dc2 _ 


with solutions, respectively 


A 


y 


Cl (A, y) — —Ay -|- a ( — , C 2 {X, y) — —AX -f- /3 ( — , 


y 


a , /3 arbitrary functions. The condition is verihed, for instance, if 

oy oX 

a and /3 are constant functions. Using the initial conditions, we hnd hnally 


ci(A, y) = —Ay — a , C 2 (A, y) = —AX — b , 
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(p(A, ju) = 4Aju + A(—4/i — a) + ju(—4A — b) = jj) . 


The Geometry of Lagrange duality is represented in Figure 1. 

(3) A strange problem Solve the following constrained optimum prob¬ 
lem: 


f{x, y, z) = xyz = extremum 


constrained by 

gi{x,y,z) = X + y + z - a = 0 , 
g 2 {x, y, z) = xy + xz + yz — b = 0 . 
So the Lagrange function is 


L{x, y, z, A, /i) = xyz + \{x + y + z — a) + y{xy + xz + yz — b) 


and one from the solutions of the problem is, for instance, 

—a — ^/o? — ?)b 2a^ — 3& -|- 2a^/a‘^ — 3b n 

F=-^-, A =-^-=/i , 

a + \/a^ — ?)b a — 2\^a‘^ — 3b 

x = y= -^-= -h, ^ ^-= a + 2/i , 

with the extremum value 

f* = ^ (-2a^ + 9a6 - 2{a^ - 3bf^‘^) = A(a + 2y ), 
only if a? — 3b > 0. 

Remark Another solution of the problem is 

—a -|- y/a? — 3b 

y = ---,... and so on 

o 


with the extremum value 

r = ^ {-2a^ + 9ab + 2{a^ - 3bf/‘^) = A(a + 2y). 


The interval [/*, /*] solves the following algebraic problem: Find the real 
numbers m such that the equation t^ — at^ + bt — m = 9, a,b E R, has tree 
real roots. 
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It is easily to verify that 

d d 

—f^{a,b) = -A and • 

On the other hand, ^. = 0 and /* cannot be expressed as function 
0 {a, b) 

of A and fx only. Then we have to consider /* = f^{a,b, \{a,b), fj,{a,b)) and 
the following relations 

, _ p . _ <9/* <9/* <9A df^ dfi 

oa oX oa Ofi aa 

^ ^ * db d\ db db 

which is easily to verify also (here D, is an operator of total derivative.) 
Question Which is the dual Lagrange function ip{X,fx) in this case? 
Solving the system of the critical points with respect to x,y and 2 : we 
hnd, for instance, x = y = —y, z undeterminate and A = With these, 
one obtains the dual Lagrange function 

jj) = x(/i) = —jj?" — af/ — by. 

Remark Although is undeterminate, the dual Lagrange function does 

dL 

not depend upon 2 :, because with the above solutions the derivative van- 

oz 

ishes identically. The critical points condition for the dual Lagrange function 

^ = -(3^2 ^2a/i + &) = 0 
ay 

gives us the same solutions as in primal problem. 

Open problem How it means and how we hnd the functions ci and C 2 

A jjX 

in the situation, like this, when ’ = 0 ? 

0 {a, b) 

(4) Let us consider the functions f{x,y) = x"^ + y"^ and g{x,y) = x + y, 
with {x,y) G R^, and the problem 

min f{x,y) constrained by g{x,y) > 1. 

The Lagrange function is 

L{x, y, \) = x'^ + y"^ + \{1 - X - y), {x, y) e R^, \> 0. 
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The function (x, y) —)■ L{x,y,\) is convex. Consequently, it is minimal iff 

dL _ dL _ 
dx dy 

This holds if x = ^, y = ^. Substitution gives 

^(A) = A - y, A > 0. 

The dual problem max'^(A) has the optimal point A = 1. Consequently, 
X = y = ^ is the optimal solution of the original (primal) problem. In both 
cases the optimal value equals i.e., at optimality the duality gap is zero! 
(5) Let us solve the program 

minx subject to < 0, x G R. 


This program is not Slater regular. On the other hand, we have 


^/>(A) 


inf (x + Ax^) = 

x&R 


-ix for A>0 
—oo for A = 0. 


Obviously, 'f{\) < 0 for all A > 0. Consequently, sup{'^(A) | A > 0} = 0. 
So the Lagrange-dual has the same optimal value as the primal problem. In 
spite of the lack of Slater regularity there is no duality gap. 

(6) (Example with positive duality gap) We consider the program 

mine“^ subject to a/x^~+^ — x < 0, {x,y) G R^. 


Here the feasible region is O = {(x, y) E R^ \ x > 0,y = 0}. Consequently this 
program is not Slater regular. The optimal value of the objective function is 
1. The Lagrange function is 

L(x, y, A) = e~^ + X{^/x"^ + — x). 


The Lagrange dual program can be written in the form 


sup'^(A) subject to A > 0. 

Note that L{x,y,\) > 0 implies > 0. Now let e > 0. Fixing y = — In e 
and X = ^ ff , we hnd x/x"^ + y"^ — x = e and 

L{x,y,\) = (1 -h A)e. 
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In this way, 


^ (1 + = 0 - 
{x^y)^R^ e>0 

On the other hand, we also have 'ip{X) > 0, and consequently the optimal 
value of the Lagrange dual program is 0, and hence the minimal duality gap 
equals 1! (No strong duality here). 

1.5 The Wolfe-dual problem 

The Lagrange dual program can be written in the form 

m 

sup { inf {/(x) + V 

A>0 ^6^ 

Assume that D = and the functions f,gi,...,gjn are continuously 
differentiable and convex. For a given A > 0 the inner minimization problem 
is convex, and we can use the fact that the inhmum is attained if and only if 
the gradient with respect to x is zero. 

Definition 2. The problem 

m 

iWP) sup {f{x) + V XaQaix)} 

x,A —f 

subjeet to 

Q:=l 

is called the Wolfe dual of the program (P). 

Obviously, the constraints in Wolfe dual are usually nonlinear. In such 
cases the Wolfe-dual is not a convex program. 

The Wolfe dual has the weak duality property. 

Theorem 3. (weak dnality property) Suppose that D = i?" and the 
functions f,gi,...,gm are continuously differentiable and convex. If x is a 
feasible solution of (P) and {x, A) is a feasible solution for (WP), then 

L{x,\) < f{x). 

In other words, weak duality holds for (P) and (WP). 
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1.6 Example 

(1) Let us consider the convex program 

min X + subject to 3a; — 2e^ > 10, ?/ > 0, (a;, y) G . 

x,y 

Then the optimal value is 5 with x = 4,y = 0. The Wolfe dual of this 
program is 

sup {a; + + Ai(10 — 3a; + 2e^) — A 2 I/} 

x,y,\ 

subject to 

1 - 3Ai = 0, + 2ey\i - A 2 = 0, (a;, y) G A > 0. 

Obviously, the Wolfe dual program is not convex. It follows Ai = 1 and 
the second constraint becomes |e^ — A 2 = 0. Eliminating Ai, A 2 from the 
objective function, we hnd 

9 {y) = 3 ^ 3 + y- 

This function has a maximum when g'{y) = 0, i.e., y = 0 and /(O) = 5. 
Hence the optimal value of (WP) is 5 and then {x,y, Ai, A 2 ) = (4, 0,1, |). 
Remark The substitntion z = > 1 makes the problem linear. 


2 Minimax inequality 

For any fnnction 0 of two vector variables x & X, y E Y, the minimax 
inequality 

max min (hix, y) < min max (h(x, y) 

y&Y x&X - xex yGY 

is true. Indeed, start from 

\/x,y : min0(a;',|/) < max0(a;, ?/') 

x'aX y'&Y 

and take the minimum over a; G X on the right-hand side, then the maximnm 
over y eY on the left-hand side. 

Weak dnality is a direct consequence of the minimax inequality. To see 
this, start from the unconstrained formulation of Lagrange, and apply the 
above inequality, with 0 = L the Lagrangian of the original problem, and 
y = X the Lagrange vector multiplier. 
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3 Nonholonomic Lagrange and Wolfe 
dual programs 

The Pfaff nonholonomic constraints in optimal programs were introduced by 
the mathematical school coordinated by Prof. Dr. Constantin Udri§te at 
University Politehnica of Bucharest (see 0, 0, IIDI-IZBI). 

In this section we address the following original issues: (i) difference be¬ 
tween a program constrained by an integral submanifold of a Pfaff equa¬ 
tion and a program constrained by a Pfaff equation; (ii) the non-holonomic 
Lagrange-dual problem with weak respectively strong duality; (iii) the non¬ 
holonomic Wolfe-dual problem; (iv) pertinent examples. 

Let 

oj = Ui{x)dx^ = 0, i = 1, x G i?", n > 3, 

be a non-completely integrable Pfaff equation (see also, [2], |H], 0, [S])- The 
condition n > 3 is imposed by the nonholonomy theory. For theoretical 
reasons, we understand that the co-vector field u = {u}i{x)) is on K^, and 
has no critical point in i?”. To this Pfaff equation we attach the (n — 1)- 
hyperplane 

= {ui{x)dx^ = 0} = {?/ = ..., ?/") G \ uji{x)y^ = 0} 

in the n-space i?”. 

Definition 3. (i) A submersion g = {ga), a = 1, ■■■,n — p, is called solution 
of the Pfaff equation u = Ui{x)dx^ = 0 if gi{x) = 0,gn-p{x) = 0 and 
dgi{x) = 0,..., dgn-p{x) = 0 imply u = Ui{x)dx^ = 0. 

(ii) A p-dimensional submersed submanifold 

{M,uj,g,=) 

of is called an integral manifold of the Pfaff equation on i?” if 
dg{Mf) C ifj., for each pointx inM. 

Definition 4. The Pfaff equation is said to be completely integrable if there 
is one and only one integral manifold of maximum possible dimension n — 1 
through each point of R^. 

Theorem 4. (Frobenius theorem) A necessary and sufficient condition 
for the Pfaff equation uj = 0 to be completely integrable is oj A du = 0. 

Here doo is the differential form of degree 2 obtained from u by exterior 
differentiation, and A is the exterior product. 
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3.1 Integral curves 

Let (T,g = {gi,gn-i),=) be an integral curve of the non-completely in- 
tegrable Pfaff equation u = Ui{x)dx^ = 0 can be written in the Cartesian 
implicit form 

r ; gi{x) = 0 , ...,gn-iix) = 0 , 

i.e., the equations gi{x) = 0,gn-i{x) = 0 and dgi{x) = Q,...,dgn-i = 0 
imply Ui{x)dx^ = 0, for any dx. This means that 


det 


dgi 

dgi 

dx^ 

dx" 

dgn-i 

dgn-i 


1 dx^ ■ ■ ■ 

V tUi ... Un 


\ 

/ 


0 . 


It follows that there exist the functions ui{x), ...,Un-i{x), 
fx, such that 


n—1 




Q:=l 


dx^ 


and the constant 


for any x. 


3.2 The connection between critical points on 
an integral submanifold and critical points 
with Pfaff non-holonomic constraint 

Let / : i?" —)■ i? be a function. Let us consider a non-holonomic program; 

{NP) min/(x) subject to ca = = 0, i = 

Suppose that a p-dimensional integral submanifold of the Pfaff equation 
u = uji{x)dx'‘ = 0 is {M,u,g = {gi,gn-p),=). We consider the attached 
program 

min/(a;) subject to x E M 

X 

or, equivalently, 

min/(a;) subject to gi{x) = 0,gn-p{x) = 0. 
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Theorem 5. A point in i?” is a nonholonomic constrained critical point 
if and only if it is critical point constrained by each integral submanifold 
containing it. 

Proof (if) For a given integral submanifold, the associated Lagrange 
function is 

L{x, A) = f{x) + \igi{x) + ... + \n-pgn-p{x). 

The critical point conditions are 

df{x) + \idgi{x) + ... + \n-pdgn-p{x) = 0, Vda;; gi{x) = 0,..., gn-p{x) = 0. 

Let X be a vector field tangent to the integral submanifold (and hence 
from distribution), i.e., 

dgi{X) = 0, ...,dgn-p{X) = 0, uj{X) = 0. 

It follows that at a critical point we must have df{X) = 0. 

If the integral submanifold is arbitrary (both as dimension and as way of 
description), i.e., g is arbitrary, i.e., X is an arbitrary vector held tangent 
to M, then from the relations u}{X) = 0,df{X) = 0,VX, we obtain the 
existence of a constant fx such that, at a critical point, which is independent 
on g, we must have 

(NCP) df{x) + nuj{x) = 0, ydx. 

(only if) Each nonholonomic constrained critical point belongs to each 
critical point set associated to a constrained integral submanifold. Indeed, if 
X is a vector held tangent to an integral submanifold, then 

dgi{X) = 0, ...,dgn-p{X) = 0, u{X) = 0. 

It follows the existence of multipliers Ai, ..., \n-p such that 

u Xidgii^x^ T ... T \n—pdgn—p{xf 


But df = —pajj. 
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3.2.1 Attached Riemannian geometry 


Let us consider the system ”NCP = nonholonomic critical point”, where the 
parameter fi is arbitrary. According to the implicit function theorem, if the 
matrix 

< 92 / dUi 


(O-ij) — 


dx^dx^ ^ ^dx^ 


is non-degenerate at a hxed critical point, then the system dehne a curve 
X = On the other hand, the matrix of elements Qjk = S^’‘aijaki is a 

Riemannian metric. Symbolically, a = (ajj), g = ^aa, g~^ = and 

the geometry induced by g follows by usual rules. 

By differentiation with respect to /i, we obtain 


dx^ 

dfi 


Q-ij - -h CJj — 0. 


Let rjk = Then gkj^ + hfc = 0 hence 

dx^ dx^ dx^ 
dkj-r:— + = 0 


djj, dg, 


dg 


Proposition The angle between the vectors gk and ^ is always obtuse. 

Completion It follows 


d‘^f g f duji duj 


. T -h - 

dx^dx^ 2 


dx^ dx^ 


dx^ dx^ dx^ ^ 
dg dg * dg 


We reinterpret this equality, introducing the fundamental tensor 


, an 

dx^dx^ 


2 


dui du. 


dx^ dx^ 


and writing 


, dx^ dx^ dx^ 

hij —— -h = 0. 

dg dg dg 


3.3 Eliminating ambiguities by 
geometric interpretation 

The Pfaff equations theory is a source of misunderstanding for beginners. 
We can eliminate such problems thinking in terms of differential geometry. 
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3.3.1 Language of Vranceanu 

Let be the image of an integral cnrve of the Pfaff eqnation Ui{x)dx^ = 0 
throngh the point xq. Denote by I^xo = {Ixq} the family of all images of 
integral cnrves throngh the point Xq. The pair 

(D, S), S = {Sa-g I Xq G D} 

is called nonholonomic hypersurface on D attached to the Pfaff eqnation 
0 Ji{x)dx^ = 0 (see also, [29]). 

To the Pfaff eqnation oji{x)dx^ = 0 and to the point xq G -D, we attach 
the nniqne (n — l)-hyperplane 

Hxo = {x G -R" I a;i(xo)(x* - Xq) = 0}. 

Since all straight lines tangent to integral cnrves which pass throngh xq are 
inclnded in H^q, the hyperplane is called the tangent hyperplane at Xq 
of (D,E). 

3.3.2 Language of distributions 

Let ns introdnce the (n — l)-hyperplane 

Hx = {ui{x)dx^ = 0} = {?/ = (?/\?/") G I uji{x)y^ = 0} 

in the n-space R"'. The rnle x —)■ Hx gives a held H of hyperplanes in RJ^, 
or what we call (n — 1)-dimensional distribution: a linear subbundle of the 
tangent bundle. In short H = Uxf^RnHx C TR^. In snbseqnent explanations, 
we shall prefer the distribntions language being more suggestive. 

Similarly, we can introduce the half-hyperplane 

= {a;i(x)dx* < 0} = {?/ = (|/\..., y^) G R^ \ Ui{x)y^ < 0}. 

The rule x —>■ H~ gives a held H~ called {—)- distribution. 

Definition 5. A p-dimensional submersed submanifold {M,H,g,=) of R^ 
is called an integral manifold of the distribution H on R^ if 

dg{Mx) C H{g{x)) = Hx, for each pointx inM. 

The held H~ is used when the program refers to manifolds whose bound¬ 
ary contains an integral manifold of a Pfaff eguation. A manifold with bound¬ 
ary is a manifold with an edge. The boundary of a (p -|- l)-manifold with 
boundary is a p-manifold. In technical language, a manifold with boundary 
is a space containing both interior points and boundary points. 
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3.4 The Lagrange dual problem 

Let {M,u,g = {gi, ...,gn-p),=) be a p-dimensional integral submanifold of 
the Pfaff equation u = Ui{x)dx^ = 0 and / : i?” —)■ i? be a function. 
Denote g = {ga), a = 1, ...,n — p, and we introduce the set 

= {x G \ ga{x) = 0, a = 1,n — p} = {x G i?” I g{x) = 0}. 


For each program 

min/(a;) subject to a; G M, 

X 

equivalently 

min{/(a;) | x G D}, 

X 

we can repeat the theory of dual programs. The Lagrange function (or La- 
grangian) of this program is 


n—p 

L{x, A) = f{x) + ^ Kga{x) = f{x)+ < \,g >,x e R'^,\ e R^~^. 

cx=l 

The critical points with respect to the variable x are given by the system 
(1) df{x)+<X,dg{x)>=0,ydx. 

It follows X = x{\), the dual function = L{x{\),\) and the Lagrange 
dual problem 

max'^(A). 

A 

But, what we understand by the dual theory for the nonholonomic pro¬ 
gram (NP)? Of course, we must ask an arbitrary integral submanifold. That 
is why, the system (1) must be replace with the system 

df{x) -|- pu{x) = 0,\/dx. 

The solution of this system is of the form x = x{p), an arbitrary dual function 
is A) = L{x{p,), A) and the Lagrange dual problem can be written 

max'^(/i. A). 

/i, A 

Each Lagrange function L{x, A) of (NP) is linear in A. 

If the distribution H is described by q Pfaff equations, then A has {n—p)q 
components and p has q components. 


21 



3.4.1 Dual nonholonomic Lagrange function 


In the non-holonomic context, the constraint function g does not exists, but 
it should be built at least on the critical point set {a;(/r)}. 

By analogy with the holonomic equality g{x{g)) = c, from the relation 

^ — “h) we can define c(/i) by Cauchy problem 

djj, \dfiJ 


{EC) 


dc Id/* / X n 

’ cm = ^ ’ 

djj, /i djj, 


and then a Lagrange dual function will be 


{LDF) =/*(a;(/i))+/ic(/r). 


This function has the derivative 6'{g) = c{g). 

Example 1. Let the objective function be f{x,y,z) = x"^ + y'^ + 
and the constraint Pfaff form u = xdy + dz = 0. Then the critical points 
condition df + yu = 0 gives us 

2 

x{y) =0, y{y) = 0, z{y) = f^{x,y,z) = 

If, for instance, we take yo = 2, the solution for the primal problem will be 
X = 0,y = 0,z = —1 and /* = 1. 

For the Lagrange dual problem the equation (EC) gives us 


dc 1 y 1 

dy y 2 2 


Consequently c{y) = — ^ + a . If, as instance, cq = 0 for yo 
and the Lagrange dual function is 




y 


0iL) = ^+L =-^+L 




2, then a 


1 


It follows d'{y) = —^ + 1 


0. Hence yo = 2 and we obtain the same solution 


as in primal problem. 

Example 2. Let consider the function f{x,y,z) = x^ + y"^ — z and the 
Pfaff form u = xdy — zdz. Solve the primal problem 


f{x,y,z) = extremum with constraint a; = 0. 
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The differential Lagrange form of the problem is 


dL = 2xdx + 2ydy — dz + y{xdy — zdz). 


The condition dL = 0, (as differential form) leads to the equations 2x = 
0, 2y + yx = 0, —1 — yz = 0, whose solutions, the critical points, are x = 
0, 1 / = 0, = — l//i. For /r < 0 the critical points are points of constrained 

minimum and the corresponding minimum values are /* = 1/y. For /x > 0 
the critical points are not constrained extremum points. 

For construct the dual problem, let us use the above described method. 

From equation (EC) we have dc/dy = l/y^. Then c{y) = 

and the Lagrange dual function will be 




Lo 


9{y) 


y 2 \y^ yl) 


The critical points are given by the equation 


e\y) 



c{y) = 0 


and a solution is yo, i.e. the strong duality holds. 

Another way Remind that a point {xo,yo,zo) is a minimum (maximum) 
point for the function f[x,y,z), constrained by the Pfaff equation a; = 0, if 
this point is a minimum (maximum) point for / restricted at any line solution 
of a; = 0, passing through (xq, yo, Zq) . So we reformulate the primal problem 
for a suitable line passing through critical points, in our case (0,0,1/c). Such 
a line has the cartesian implicit equations 


f y"^ — z — l/c = 0 
\ 2yz — a; = 0 . 


Then the Lagrange function of the problem is 

L{x, y, z, \, y) = x"^ + y"^ — z + — z — 1/c) + y{2yz — x). 


We obtain the following system of the critical points: 


dx 


2x — y = D 
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2y + 2Xy + 2yz = 0 


dL 

dy 


dL 

— - -1 - +2yy 
This system has the solution (0, 0,1/c) for y ■ 


0 . 

0 and A 


1 . 


3.5 Case of nonholonomic inequalities 

Let {K^,u,g = {gi, ...,gn-p), <, ^) be a subset of i?”, attached to the Pfaff 
inequation u = Ui{x)dx^ < 0 , whose boundary contains the p-dimensional 
integral submanifold (M, uj,g = {gi, ...,gn-p), =, =) of the Pfaff equation. 

Let be a function. Denote g = (go), a = 1, ■■■,n — p, and 

we introduce the set 


Q = {x E \ ga{x) < 0, a = 1, ...,n — p} = {x E \ g{x) ^ 0 }. 


Theorem 6 . (Aveak duality^ The dual function yields lower bounds of the 
initial optimal value /*, i.e., for any \, we have 99 (A) < /*. In other words, 


sup { 99 (A) I A ^ 0} < min {f{x)+ < A, g{x) >, x eLL, A ^ 0}. 

Theorem 7. (strong duality) If the program (P) satisfies the Slater con¬ 
dition and has finite optimal value, then 


sup { 99 (A) I A ^ 0} 

A 


min 


{/(^)}- 


Moreover, then the dual optimal value is attained. 


3.6 Nonholonomic Wolfe dual 

The problem 

max {f{x)} 

X, fL 

subject to 

Of 

^(x) + yui(x) = 0, /i > 0 

is called the nonholonomic Wolfe dual (WDNP) of the nonholonomic program 
(NP). 

It follows X = x{y) and f{x{fj,)). That is why, solving the dual problem 
is equivalent to hnd extrema of the function y -E f{x{y)). 
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3.7 Examples 

(1) see [28], p.190-191 (Consumer theory with nonholonomic con¬ 
straint) Suppose that 

n 

u{x) = • • • a;“", a* > 0, a* < 1, x = {xi,Xn) G -R" 

i=l 

is the utility function defined over n goods. Denote hj Pi{x) > 0, i = 1, 
the prices of the goods. 

Let us determine what is the proportion of income that the associated 
consumer will spend on each good, if the budget constraint is the Pfaff in¬ 
equality Pi{^)dxi > 0. 

The utility function is concave. We must look for critical points of the 
utility function u subject to the given nonholonomic constraint. To determine 
the constrained critical points, we use the Lagrange 1-form 

n ^ / \ 

1 ] = du{x) — fi Pi{x)dxi = ( — u{x) — fipi{x) I dxi, 

and we write the system 

aiu{x) — ppi{x)xi = 0, i = 1, 

Suppose we have a critical point (solution) x* = x*{p), p > 0. Each 
component X* of the critical point is the quantity consumed of the good. 
Moreover, pi[x*)x* is the income spend on the good and is 

the total income. Since 

n n 

u{x)'^ai - p'^pi{x*)x* = 0 , 
i=l i=l 

we hnd the proportion of the income spent on the good, 

Pi{x*)x* _ at 

which is independent on consumed quantities and of prices [economic law). 

Suppose we are interested in the maximum of the utility function u sub¬ 
ject to the nonholonomic constraint. To solve this problem, as usual we look 
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for a critical point x* = x*{fi), fi > 0, which verify the equality in the budget 
constraint (hyperplane) = 0 and the negative dehniteness 

of the restriction of the quadratic form 



to the budget hyperplane. 

(2) Nonholonomic initial program Find extremum points of the func¬ 
tion /(x, y, z) = 2xy + subject to zdx — dy > 0, xdy + dz > 0 (see [T7]h 
The constrained critical points are solutions of the system 

2y — niz = 0, 2x + Hi — /i 2 X = 0, 2z — ^2 = 0, pi <0, /i 2 < 0. 

It follows the family of critical points 



The nature of each critical point is hxed by the signature of the quadratic 


form {4:—y 2 )dxdy—yidxdz+2dz‘^ restricted to ^dx—dy = 0, j^^dy+dz = 0. 

It follows the restriction q = ^2 + 2 — dx'^, which is negative 

dehnite. All critical points are maximum points. The manifold of critical 
points has the implicit Cartesian equation 

xz{z — 1) = y, X > 0,y > 0, z < 0. 

The maximum value of the function / is 



If we change the constraints into 

zdx — dy < 0, xdy + dz < 0, 

then it will be sufficient to have positive multipliers and a positive dehnite 
quadratic form q in order that each critical point becomes a minimum point. 
The PDFs system which gives us Ci(/ii,/r), C 2 (/ri,/i 2 ) is (see 1.2 ) 
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2 


dC2 dC2 1 

/^1 - 1 “ /^2 7 ^- = /^2 + 


a/il ^^^2 
with solutions, respectively, 

Ci(/ii, ^ 2 ) = ~hl 


/^i 


h2 ~ 2 


2 /ii 

h2 


ln|/X2-2|+«i ^ , 


Hi 


f^2 


I N 1 /^? 

C2(/il,h2) - - 77 /i 2 - -2 \ 9 

/ H 2 \ 1^2 ~ ^ 


-In|/r2-2|)+a2(^) , 


/^i 


where cti, a 2 are arbitrary functions. The condition is verihed. 


9/i2 9/^1 


/^i 


/^2 


for instance, if ai — = ct. = ai and 02 — =- 2 + <^ 2 , «2 = ct. 


/^i 


/^2 




^^2 


Finally, the Lagrange dual function 0{hi, H 2 ) = f* + Hi^i + ^^ 2 C 2 is 

fJj^ 1 jJj^ 

diHi^ /^ 2 ) = —^ - Tfi 2 - — In |/r 2 - 2| + ai/ii + Q! 2 /i 2 • 

/ 4 /i2 

Nonholonomic Wolfe dual program Find the extrema of the function 
(/^i, /^2) -t (fiHi, h2) = f{xiHu /^2), /i2), /n2)), /ii <0, H2< 0. 


Since 


V?(/il,h2) = 


/i?/i2 


2(/i2 - 2) 4 


+ 7/^2 — O5 


in the conditions of the problem, the critical point /xi = 0,/r 2 = 0 is a min¬ 
imum point. Also, all the points of the form [fn < 0,^2 = 0) are minimum 
points. 


4 Extrema in canonical coordinates 

The next Theorem, which gives the canonical Pfaff forms (and hence canon¬ 
ical nonholonomic constraints), is named after Jean Gaston Darboux who 
established it as the solution of the Pfaff problem (see 0 . 0 . psi). 

Theorem 8. (Darboux Theorem^) Suppose that u is a differential 1- 
form on an n dimensional manifold, such that du has constant rank p. 
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If u A {duY = 0 everywhere, then there is a local system of coordinates 
x'P, y^,y'P in which 


p 

CO = ^ x^dyh 

i=l 

If, on the other hand, u A {duY Y 0 everywhere, then there is a local system 
of coordinates x^,...,x^, y^,...,y^,z in which 

p 

u = '^^x'^dy'^ + dz (contact form) 

i=l 


or 


UJ = 


— '^^y'^dx'^ \ + dz (symmetric normal form). 


. i=l 


i=l 


From the normal form of Darboux, we see that the maximal integral 
manifolds are of dimension p. For the contact form equation 


u = 


^ x^dy^ + dz = 0, 


i=l 


they are given by 

^ = f{y\-;y^), 

where / is a arbitrary function. 






dyP' 


4.1 Case of even number of variables 

Let us hnd the extrema of a function f{x,y),x = {x^),y = {y^),i = 
subject to a nonholonomic constraint written as Pfaff equation u = x^dy^ + 
... + x'^dy"' = 0 . 

The constrained critical points are solutions of the system 


dx^ 


0 , 


dy^ 
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4.2 Case of odd number of variables 


Let us find the extrema of a function f{x, y,z),x = {x^),y = {y^),i = 1, •••, n 
subject to a nonholonomic constraint u = x^dy^ + ... + x^dy^ + dz = 0. The 
constrained critical points are solutions of the system 


K 

dx^ 


0 , 


dy^ 


+ yx^ 


n 


0 . 


To decide the type of a critical point (xq, i/o; 2 : 0 ), we use the restriction 
of the quadratic form Q = d‘^f{xo,yo,zo) + ydijdx^dy^ to the hyperplane 
dijX^dy^ + dz = 0 and its signature. Since 


d^f = 


ay 

dx^dx^ 


dx'^dx^ + 


ay 

dy^dy- 


-dy^dy^ + 2 


ay 

dx^dy- 


-dx^dy^ 


+2 

the restriction of Q is 

S‘f 


ay , ay y , ay_,, 


dx^dz 


dx^ + 


dy^dz 


dy'- ) dz + , 


Q = 


dx^dx^ 


-dx^dx^ + ^ ,l ■ dy''dy^ + 2 ^ .1^ . dx^dy^ + ySijdx^dy^ 


dy^dy^ 


dx^dy^ 


-2 


d^f 

dx^dz 


d^f 


d^f, 


) dkiXody’^ + -^{dhix^dy^) 


dy^dz 


/ 


4.2.1 Another point of view 

The general solution of the Pfaff equation u = x^dy^ + ... + x'^dy^ + dz 

is 


z = (p{y), X = — ^(y), where cp is arbitrary. 
Consequently the previous nonholonomic program can be written 


min f{x,y,z) subject to z = <p{y), x 


dip 

dy 


{y)- 


0 


In this form, it is similar to a classical program, but the function ip is arbi¬ 
trary. For each ip, we attach a Lagrangian 


L{x, y, z, Ai, A 2 ) = f{x, y, z) + Xi{z 


^{y)) + A 2 
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It follows the system which describes the critical points 


dL_df _ dL 

“ b-H A2 — U, — 

OX OX Oz 


dl 

dz 


+ Ai = 


0 


dL 

dXi 


dL df .dip ^ 

“ b-"^^b—^ "^2 

Oy Oy Oy 


= z- piy) = 0, 


dL 

d\2 


d'^p 

dydy 

dp 

^ + b“ 
dy 


0 


{y) = 0 . 


This means that the critical points with respect to x, y and must verihes 
the constraints of the initial program. 

In this context, it is very clear what means a Lagrange dual program. 
The dual function is 


A2) — L(a;(Ai, A2), |/(Ai, A2), ^(Ai, A2), Ai, A2). 


Example Find extremum points of the function f{x, y, z) = x + y + z + 
+ y"^ + z^) subject to dz — xdy = 0. 

The constrained critical points are solutions of the system 


d_l 

dx 



— px 



+ p 


0 , 


i.e., X = l,y = 1 — p, z = 1 + p. 

The nature of critical points is determined by the signature of the restric¬ 
tion q of the quadratic form Q = — -|- dy"^ + dz"^) — pdxdy to the plane 

dz = dy. It follows q = — (dx^ -|- 2dy‘^) — pdxdy. This quadratic form is 
negative dehnite for /i^ < 8, i.e., p G {—2^/2,2^/2). In this case, all critical 
points are maximum points. 

The function p{p) = f{x{p),y{p), z{p)) = \—p^ is increasing on (—2v^, 0) 
and decreasing on (0, 2\/2). Also, inf p{p) = | — 8. 


4.2.2 Passing to an even number of variables 

The point (x, y, z) belong to the contact manifold dehned hj u = xdy + dz = 
0. Let t be the number to be multiplied with 1-form uj to obtain a point of a 
symplectic (2n -|- 2)-manifold described by the 1-form y = tu = txdy + tdz. 
If we pass to the coordinates P = (P^) = {p,po),p = tx,po = t, Q = {Q^) = 
{d) %))d = V) % = 2 :, then 7] = PdQ and hence dy = dP A dQ. 
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Having in mind the changing of the variables, the function f{x,y,z) be¬ 
comes / = q^{P,Q)- The constrained critical points are solutions 

of the system 


dip 


0 , 


dp 




dp 


0 . 


4.2.3 Contact Hamiltonian 

Let X = + if-§pi + z-^,i = be the contact vector held. Let 

K{x,y,z) be the contact Hamiltonian, which is dehned by 


K{x,y, z) = u{X), X\du\uj=o = dK. 


In case of a; = xdy -|- dz, we have du\oj=o = —dx A dy. It follows the Hamil¬ 
tonian K = xy + z and the contact how 


dK dK dK dK 

^ = y = ^ ^ ~^xr- 

dy dz dx dx 


Open problem Find extrema of the contact Hamiltonian K{x, y, z) con¬ 
strained by ca = xdy + dz = 
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